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Generalized q-Cramer-Rao inequalities 



1. Introduction 

It is well known that the Gaussian distribution has a central role with respect to classical information 
measures and inequalities. For instance, the Gaussian distribution maximizes the entropy over all 
distributions with the same variance; see [TJ Lemma 5]. Similarly, the Cramer-Rao inequality, e.g. 
[TJ Theorem 20], shows that the minimum of the Fisher information over all distributions with a 
given variance is attained for the Gaussian distribution. Generalized g-Gaussian distributions arise 
as the maximum entropy solution in Tsallis' nonextensive thermostatistics, which is based on the 
use of the generalized Tsallis entropy. The Generalized g-Gaussian distributions also appear in 
other fields, namely as the solution of non-linear diffusion equations, or as the distributions that 
saturate some sharp inequalities in functional analysis. Furthermore, the generalized g-Gaussian 
distributions form a versatile family that can describe problems with compact support as well as 
problems with heavy tailed distributions. 

Since the standard Gaussian is both a maximum entropy and a minimum Fisher information 
distribution over all distributions with a given variance, a natural question is to find whether this 
can be extended to include the case of the generalized g-gaussian distribution, thus improving the 
information theoretic characterization of these generalized g-Gaussians. This question amounts to 
look for a definition of a generalized Fisher information, that should include the standard one as 
a particular case, and whose minimum over all distributions with a given variance is a g-Gaussian. 
More generally, this should lead to an extension of the Cramer-Rao inequality saturated by q- 
Gaussian distributions. 

Several extensions of the Fisher information and of the Cramer-Rao inequality have been 
proposed in the literature. In particular, the beautiful work by Lutwak, Yang and Zhang [5] gives 
an extended Fisher information and a Cramer-Rao inequality saturated by q-Gaussian distributions. 
In an interesting work, Furuichi [3 J defines another generalized Fisher information and a Cramer- 
Rao inequality saturated by g-Gaussian distributions. In this latter contribution, the statistical 
expectations are computed with respect to escort distributions. These escort distributions are 
one parameter deformed versions of the original distributions, and are very useful in different 
formulations of nonextensive statistics. In the present paper, we will recover these results and show 
that the two generalized Fisher informations above and the associated Cramer-Rao inequalities are 
actually linked by a simple transformation. 

In section [2j we give the main definitions and describe the ingredients that are used in the 
paper. In particular, we give the definition and describe the importance of generalized g-Gaussians, 
we define the notion of escort distributions, and finally give some definitions related to deformed 
calculus in nonextensive statistics. 

The Cramer-Rao inequalities indicated above are inequalities characterizing the probability 
distribution, and the Fisher information is the Fisher information of the distribution. Actually, 
the Fisher information is defined in a broader context as the information about a parameter of 
a parametric family of distributions. In the special case of a location parameter, it reduces to 
the Fisher information of the distribution. The Cramer-Rao inequality appears in the context of 
estimation theory, and as it is well known, defines a lower bound on the variance of any estimator 
of a parameter. In section [3J we describe the problem of estimation, recall the classical Cramer-Rao 
inequality, and show that the standard Cramer-Rao inequality can be extended in two directions. 
First we consider moments of any order of the estimation error, and second we use generalized 
moments computed with respect to an escort distributions. This lead us to two new Cramer-Rao 
inequalities for a general parameter, together with their equality conditions. In this context, the 
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definitions of the generalized Fisher informations will pop up very naturally. For a very general 
definition of escort distributions, we also recover a general Cramer-Rao inequality given in a deep 
paper by Naudts [4]. 

In section |4j we examine the special case of a translation parameter and show, as an immediate 
consequence, that the general results enable to easily recover two Cramer-Rao inequalities that 
characterize the generalized g-Gaussian distributions. So doing, we recover the definitions of 
generalized Fisher informations previously introduced by Lutwak et al. [2] and Furuichi [3J. 
Furthermore, we show that the related Cramer-Rao inequalities, which are similar to those of 
[2] and [3J, are saturated by the generalized g-Gaussians. Finally, in section [5j we discuss some 
new estimation rules that emerge from this setting, and in particular point out a connection to the 
MLq-likelihood method that has been introduced recently, c.f. [51IS]- 

2. Definitions and main ingredients 

2.1. Generalized q-Gaussian 

The generalized Gaussian distribution is a family of distributions which includes the standard 
Gaussian as a special case. These generalized Gaussians appear in statistical physics, where they 
are the maximum entropy distributions of the nonextensive thermostatistics [7]. In this context, 
these distributions have been observed to present a significant agreement with experimental data. 
They are also analytical solutions of actual physical problems, see [EJ|9] [ID], [H], and are sometimes 
known as Barenblatt-Pattle functions, following their identification by [T2"lll3| . Let us also mention 
that the generalized Gaussians are the one dimensional versions of explicit extremal functions of 
Sobolev, log-Sobolev or Gagliardo-Nirenberg inequalities on W 1 , as was shown by [H][l5] for n > 2 
and by [H] for n > 1. 

Definition 1. Let i be a random variable on R. For a G (0, oo), 7 a real positive parameter and 
q > 1 — a, the generalized g-Gaussian with scale parameter 7 has the symmetric probability density 



where we use the notation (x), = max{a;,0}, and where ^(7) is the partition function such that 
G 7 (cc) integrates to one: 



where B(x,y) is the Beta function. 

For q > 1, the density has a compact support, while for q < 1 it is defined on the whole real 
line and behaves as a power distribution for |x| — > 00. Notice that the name generalized Gaussian 
is sometimes restricted to the case q = 1 above. In this case, the standard Gaussian is recovered 
with a = 2. 




(1) 
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2.2. Escort distributions 

The escort distributions are an essential ingredient in the nonextensive statistics context. Actually, 
the escort distributions have been introduced as an operational tool in the context of multifractals, 
c.f. [17] . |18| . with interesting connections with the standard thermodynamics. Discussion of their 
geometric properties can be found in |19II20| . Escort distributions also prove useful in source coding, 
as noticed in [3T|. They are defined as follows. 

If f(x) is a univariate probability density, then its escort distribution g{x) of order q, q > 0, is 
defined by 

provided that Golomb's "information generating function" [22] 

M q [f] = f f(x) q dx (4) 



is finite. 

Given that g(x) is the escort of f(x), we see that f(x) is itself the escort of g(x) of order 
q = l/q. 

Accordingly, the (absolute) generalized (/-moment of order p is defined by 

m P,qlf\ '■= E i l\ x \ \ = J \x\ p g(x)dx = j f ( x)qdx . ( 5 ) 

where E q [] denotes the statistical expectation with respect to the escort of order q. Of course, 
standard moments are recovered in the case q = 1. 

2.3. Deformed functions and algebra 

In Tsallis statistics, it has appeared convenient to use deformed algebra and calculus, c.f. |23[ 124] . 
The ^-exponential function is defined by 

exp q (x):=(l + (l-q)x)f* , (6) 

while its inverse function, the q-logarithm is defined by 

T }-1 _ i 

ln 9 (x) := (7) 

When q tends to 1, both quantities reduce to the standard functions exp(x) and ln(a;) respectively. 

In the following, we will use the notation q = 1/q that already appeared above in connection with 

escort distributions, and the notation q* = 2 — q that changes the quantity (1 — q*) into (q — 1), e.g. 

i 

exp Qr (x) := (1 + (q — l).!)^ 1 . We note the following expressions for the derivatives of deformed 
logarithms: 
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The g-product is a deformed version of the standard product such that standard properties of 
exponential and logarithm functions still hold for their deformed versions. The g-product is defined 

x® q y:= (x 1 -" -l) 1 -* (9) 

and gives the identities 

hi q (x ® g y) = \n q {x) + ln g (x) and exp 9 (x + y) — exp g (x) ® q exp g (y) (10) 

2.4- Fisher information 

The importance of Fisher information as a measure of the information about a parameter in a 
distribution is well known, as exemplified in estimation theory by the Cramer-Rao bound which 
provides a fundamental lower bound on the variance of an estimator. The statement of the standard 
Cramer-Rao inequality, as well as several extensions, will be given in section[3] 

It might be also useful to note that Fisher information is used as a method of inference and 
understanding in statistical physics and biology, as promoted by Frieden [551 HU • It is also used 
as a tool for characterizing complex signals or systems, with applications, e.g. in geophysics, 
in biology, in reconstruction or in signal processing. Information theoretic inequalities involving 
Fisher information have attracted lot of attention for characterizing statistical systems through 
their localization in information planes, e.g. the Fisher-Shannon information plane [271 128| or the 
Cramer-Rao information plane |29j . 

Definition 2. Let f(x; 9) denote a probability density defined over a subset X of R, and 9 £ a real 
parameter. Suppose that f(x; 9) is differentiable with respect to 9. Then, the Fisher information 
in the density / about the parameter 9 is defined as 

i 2 Am = f x (^^) 2 f( X -,o) d x. (ii) 

When 9 is the location parameter, i.e. f(x;9) = f(x — 9), the Fisher information, expressed at 
9 = 0, becomes a characteristic of the distribution: the Fisher information of the distribution: 

'^-L(^) 2f ^ (12) 

The meaning of the subscripts in the definition will appear in the following. 
3. Generalized Cramer-Rao inequalities 

In this section, we begin by recalling the context of estimation, the role of Fisher information and 
the statement of the standard Cramer-Rao theorem. Then, we show how this can be extended to 
higher moments, and to generalized moments computed with respect to an escort distribution. 

3.1. The standard Cramer-Rao inequality 

The problem of estimation, in a few words, consists in finding a function 6{x) of the data x, 
that approaches the unknown value of a characteristic parameter 9 (e.g. location, scale or shape 
parameter) of the probability density of these data. 

A standard statement of the Cramer-Rao inequality is recalled now. 
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Proposition 1. [Cramer-Rao inequality] Let f(x]ff) be a univariate probability density function 
defined over a subset X of R ; and 9 G a parameter of the density. If f(x) is continuously 
differ entiable with respect to 9, satisfies some regularity conditions that enable to interchange 
integration with respect to x and differentiation with respect to 9, then for any estimator 9{x) 
of the parameter 9, 



E 



9{x) - . 



h,i[f;B] > 



1 



8_ 

89 



E 



9(x) - 



(13) 



When the estimator is unbiased, that is if E 6{x) — 9, then the inequality reduces to 



E 



9{x) - i 



hAf] >1. 



(14) 



The estimator is said efficient if it is unbiased and saturates the inequality. This can happen 
if the probability density and the estimator satisfy J| ln/(x; 9) — k(9) (§(x) — 9 



3.2. Generalized Cramer-Rao inequalities for higher moments and escort distributions 

The Fisher information is usually defined as the second order moment of the score function, the 
derivative of the log-likelihood, but this definition can be extended to other moments, leading to a 
generalized version of the Cramer-Rao inequality. This extension, which seems not well known, can 
be traced back to Barakin [30, Corollary 5.1]. This generalized Fisher information, together with 
the extension of the Cramer-Rao inequality, has also been exhibited by Vajda |31] as a limit of a 
X a -divergence. We will recover this general Cramer-Rao inequality, as well as the standard one, 
as a particular case of our new g-Cramer-Rao inequalities. The main idea here is to compute the 
bias, or a moment of the error, with respect to an escort distribution g(x; 9) of f{x; 9) instead of 

the initial distribution. If we first consider the g-bias defined by B q (9) :— E q 9{x) — 9 , we have 
the following general statement 

Theorem 1. [Generalized q-Cramer-Rao inequality] - Let f(x; 9) be a univariate probability density 
function defined over a subset X ofR, and 9 £ a parameter of the density. Assume that f(x; 9) 
is a jointly measurable function of x and 9, is integrable with respect to x, is absolutely continuous 
with respect to 9, and that the derivative with respect to 9 is locally integrable. Assume also that 
q > and that M q [f;0] is finite. For any estimator 9(x) of 9, we have 



E 



9(x)~9 " lf,, q [f;0\* > 



9 F 



9(x)- 



(15) 



with a and (3 Holder conjugates of each other, i.e. a + f3 = 1, a > 1, and where the quantity 



Ip, q [f;e] = E 



M q [f:9] 89 \M,f.. 
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M q [f;9}~« 



E 



9 1 

89 ln «* 



f(x;9) 
M q [f;6] l « 



(16) 
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is the generalized Fisher information of order (ft, q) on the parameter 8. The equality case is obtained 
if 



q 



9 i 

-t^ ln„ 



M q [f;6]k 98 q * \M q [f; 



c{6) sign (§(x) - O) 8{x) - 8 



(17) 



with c(8) > 0. 

Observe that in the case q = 1, Mi[f;6] = 1 and the deformed logarithm reduces to the 
standard one. Immediately, we obtain the extended Barakin-Vajda Cramer-Rao inequality in the 
q = 1 case, as well as the standard Cramer-Rao inequality (|13|) when q = 1 and a = ft = 2. 

Corollary 1. [Barakin-Vajda Cramer-Rao inequality] - Under the same hypotheses as in Theorem^ 
we have 



E 



8{x) 



1 



E 



with 



ifsM'A = E 



^ln(/(z;0)) 



(18) 
(19) 



and equality if Jjg In (f(x; 8)) — c(0)sign (o(x) — 8(x) 



This inequality generalizes the standard a = 2 Cramer-Rao inequality to moments of any order 
a > 1. 



Proof, [of Theorem [T] Consider the derivative of the g-bias 



d 



c) 



08 



08 



x 



8{x)- 



M q [f;9] 



dx. 



(20) 



The regularity conditions in the statement of the theorem enable to interchange integration with 
respect to x and differentiation with respect to 8, so that 



0_ 

08 



x 



M q [f;8] 



dx 



X M q [f:8] 



(x)-8 



dx 



x 



4fM) f 6 M q [f-8] 
f(x;8) M q [f-8] 



f(x;9) 



9-1 



M q [f:t 



-f(x;8) dx, 



or, since the first term on the right is equal to -1 and since the term in bracket can be written as 
the derivative of the logarithm of the escort distribution of f(x; 8), 



1+1^(0) 



A" 



f(x;8)"\ f{x 



w-i 



M q \f-8\) M q [f-8] 



-f(x-8) dx. 



(21) 



Consider the absolute value of the integral above, which is less than the integral of the absolute 
value of the integrand. By the Holder inequality, with a > 1 and ft its Holder conjugate, we then 
have 



d_ 

08 



B q (8) 



< 



x 



8{x) 



f(x;8)dx 



x 



5_ ( f(x-,0)« \ fjx-ey- 1 

08 \M q [f;0]) M q [f;8] 



f(x;8)dx 



(22) 
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which is the generalized Cramer- Rao inequality (j!5[) . By elementary calculations, we can identify 
that the generalized Fisher information above can also be expressed as the derivative of the g»- 
logarithm, as indicated in the right side of (fl"6)) . Finally, the case of equality follows from the 
condition of equality in the Holder inequality, and from the requirement that the integrand in f|2 1 1) 
is non negative: this gives 



d 



ln„ 



M q [f;0\* 96 q *\M q [f; 



= k{6) 



6{x) 



and 



89 9 * 



f(x;9) 



which can be combined into the single condition (fTT)) . with c{9) = k{9) p > 0. 



> 0, 

(23) 
□ 



By the properties of escort distributions, we can also obtain an inequality that involves the 
g-moment of the error (§(x) — 9^j instead of the standard moment. Indeed, if g(x; 9) denotes the 

escort distribution of f(x, 9) of order q, then, as already mentioned, f(x; 9) is the escort of order q 
of g(x] 9), and 

g{x-9f 



f(x;9) = 



N q [g;( 



(24) 



with N q [g; 9] = J x g(x; 9fAx = M q [f; 9]^. 

With these notations, we see that the expectation with respect to / is the g-expectation with 
respect to g, and that the g-expectation with respect to / is simply the standard expectation with 
respect to g. On the other hand, we also have a simple property that links the deformed logarithms 
of orders g* and q: 

Proposition 2. Let b > 0, a — b q . With g* = 2 — q and q = 1/q, the following equality holds: 

ln ? (a) = grrv(&). (25) 



Proof. By direct verification. 

In particular, we note that with a = g(x; 9) 



□ 



A/„ [/;»]> Wt3 naV(3 



]n q {g) = q\n q ,(f{x-9)/M q [f-9]r 



(26) 



With these elements, the simple expression of the extended Cramer-Rao inequality (|15p in 
terms of the escort g(x; 9) of f(x, 9) yields the following corollary. 

Corollary 2. [Generalized escort- q- Cramer- Rao inequality] - Under the same hypotheses as in 
Theorem^ we have 



d_ 
d~9' 



E 



(27) 



with a and j3 Holder conjugates of each other, i.e. a 1 + /3 1 = 1, a > 1, and where the quantity 



I , q [g;9} = {N q [g-9]f E q 



(N q [g;9] 



(28) 
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is the generalized Fisher information of order (/3, q) on the parameter 9. The equality case is obtained 
if 

d 



09 



ln 5 {g(x; 9) = c(0)sign (§(x) - d) \o{x) 



(29) 



Note that this is simply a rewriting of the initial extended expression of extended Cramer- Rao 
inequality (fT5|) in terms of the escort g(x;9) of f(x,9). The generalized Fisher information Ip : q[g] 
is the same as lp,q[f',0], up to the rewriting in terms of g. The second Cramer- Rao inequality in 
(|27| is nice because it exhibits a fundamental estimation bound for a q-moment on the estimation 
error, thus making a bridge between concepts in estimation theory and the tools of nonextensive 
thermostatistics. What we learn from this result is the fact that for all estimators with a given 
bias, the best estimator that minimizes the g-moment of the error is lower bounded by the inverse 
of the (generalized) Fisher information. 

We shall also discuss in some more details the case of equality in the two Cramer-Rao inequality. 
It appears that the general solution that saturates the bounds is in the form of a deformed q- 
exponential. 

Consider the conditions of equality (|17[) and (|30[) in the two Cramer-Rao inequalities. In the 
first case, we have that the distribution which attains the bound shall satisfy 



9 1 



f(x;9) 



c{9) sign(0(x 



9{x) 



a-l 



(30) 



d9 



de 



(31) 



(32) 



M q [g-9]« 

where c(9) is a positive function. The general solution of this differential equation has the form 

f(x; 9) oc exp / c(9) sign 
Similarly, in the second Cramer-Rao inequality, we get that 

g{x; 9) cx ex P(? (j c{9) sign (§(x) - 9^ 9(x) - 9 
which is the escort of f(x; 9). 
3.3. Yet another pair of Cramer-Rao inequalities 

It is quite immediate to extend the Cramer- Rao inequalities above to an even broader context: 
let us consider a general pair of escort distributions linked by say g = (j)(f), with <f> : R + — > M. + 
monotone increasing, and / = cj)^ 1 (g) — ip(g). Denote and the corresponding expectations, 
e.g. Efj, [\x\ a ] = j x \x\ a cf> (f(x)) dx. Following the very same steps as in Theorem [TJ we readily 
arrive at 



E 



9(x) 



d_ 



E,:, 



where 



Jx 



dm/86 
f{x;0) 



dx = E 



9(x) - 9 



(33) 



(34) 
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Then, the analog of corollary^] takes the form 



9{x) - . 



de 



-E 



0(x) 



with 



J X 



dg/dd 



dx = E. 



Of) 



W (9) 



(35) 



(36) 



where the function ln^(w) is defined by ln^(u) := J" — Lrda;, and where the equality case in (|35[) 



if and only if g(x] 9) oc exp^, J Q k(9) 



6(x) 



d6, with exp^, the inverse function of ln^, . 



During the writing of this paper, we realized that a result similar to (|35p. though obtained using a 
different approach and with slightly different notations, has been given in a deep paper by Naudts 
[4|. In this interesting work, the author studied general escort distributions and introduced, in 
particular, the notion of ^-exponential families. 



4. Inequalities in the case of a translation family 



In the particular case of a translation parameter, our g-Cramer-Rao inequalities reduce to two 
interesting inequalities that characterize the g-Gaussian distributions. 

Let 8 be a location parameter, and define by f(x; 9) the family of density f(x; 9) — f(x — ff). In 
this case, we have that -^gf(x; 9) = —^-f(x—d), and the Fisher information becomes a characteristic 
of the information in the distribution. 

Let us denote by \i q the g-mean of f(x), that is of the escort distribution g(x) associated with 
f{x). We immediately have that the g-mean of f(x; 9) is (fi q +9). Thus, the estimator 9{x) — x—fi q is 
a q- unbiased estimator of 9, since E q 



will be biased, E„ 



8{x) 



= 0. Similarly, if we choose 9{x) — x, the estimator 
so that the derivative of the bias with 



= [i qi but independent of 
respect to 9 is zero. Finally, let us observe that for a translation family, the information generating 
function M q [f; 8] = M q [f] is independent of the parameter 9. 



4-1. Cramer-Rao characterizations of generalized q- Gaussian distributions 

These simple observations can be applied directly to our two Cramer-Rao inequalities (|15p and (|27|). 
This is stated in the two following corollaries. 

Corollary 3. [Generalized q-Cramer-Rao inequality] - Let f(x) be a univariate probability density 
function defined over a subset X o/M. Assume that f{x) is a measurable function of x, is integrable 
with respect to x. Assume also that q > and that M q [f] is finite. The following generalized 
Cramer-Rao inequality then holds: 



E[\x\ a ]"Ip, q [f]^>l 



(37) 



with a and j3 Holder conjugates of each other, i.e. a 1 + f3 1 = 1, a > 1, and where the quantity 



W/] = E 



M q [f] 



dx 



In (/(*)) 



M q [f] 



E 



d^* 



(/(*)) 



(38) 
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is the generalized Fisher information of order (/3, q) of the distribution. The equality case is obtained 
if 

f(x) oc exp^ (—7 \x\ a ) , with 7 > 0. (39) 

Proof. This is a direct consequence of (|T5|) . with 6(x) ~ x and 6 — 0. The case of equality is obtained 
by integration and simplifications of (|30[) . where the derivative with respect to is replaced by the 



derivative with respect to x; with ^/(x; 1 



-&/(*-»)• 



□ 



Corollary 4. [generalized escort-q- Cramer- Rao inequality] - Under the same hypotheses as in 
Theorem^ we have 



E,[\xn°I M [g]?>l 



(40) 



with a and f3 Holder conjugates of each other, i.e. a +/3 = 1, a > 1, and where the quantity 



g(x)^^- In (.9) 
da; 



= (N q [g)fE, 



^ln ? ( 9 ( 3 



(41) 



is the generalized Fisher information of order (/3, q) of the distribution g. The equality case is 
obtained if and only if 

g(x) oc exp^ (— 7 \x\ a ) , with 7 > 0. (42) 

In these two cases, the general extended Cramer-Rao inequalities lead to inequalities for the 
moments of the distribution, where the equality is achieved for a generalization of the Gaussian 
distribution. 

Let us finally note that by the same reasoning as above, the general inequalities (|33p and (|35[) 
yield 



E[\x\ a }~ E 



— hv (4>{f(x))) 



> 1 



(43) 



with equality if and only if 

g(x) = <fi(f(x)) oc exp^ (—7 \x\ a + k) , with 7 > and k a constant. (44) 

4-2. Connections with earlier results 

In the case q = 1, the characterization result in Corollary[3] has first been given by Boekee [32], who 
studied the generalized Fisher information Ip i[f] and gave a Cramer-Rao inequality saturated by 
the generalized Gaussian g{x) oc exp (—7 \x\ a ) . 

It is also important to link our findings to a result by Lutwak et al. [2J. In that remarkable 
paper, the authors defined a generalized Fisher information, which can be written as 



faAf] = E 



f{x)^-\n{f{x)) 
ax 



E 



A 

Ax 



In,* (f(x)) 



(45) 



and is similar to our Ip t q[f] in (|38|) . up to a factor f M q ^ 
Cramer-Rao inequality in the form 

E[\x\ a }"<f>e tq [f}& >E G [\x\ a }° 



. Then, they established a general 
■,JG]W, (46) 
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where G is any generalized Gaussian as in (JTJ). Actually, their result (obtained in a very different 
way), can be seen as an improved version of (|37|) . Indeed, rewriting the inequality (|37p in terms of 
<f>p,q [/]) we nave 

SN^^.J/^^MJ/]. (47) 

Then, the inequality (|46[) can be obtained by minimizing the lower bound in the right of (|47p . as is 
described in |33j . 

Similarly, the characterization result in Corollary[4| can be connected to a recent result by 
Furuichi [3l [34] in the case a = /3 = 2. In these very interesting works, the author investigated 
Cramer-Rao inequalities involving g-expectations. More precisely, he considered unnormalized 
escort distributions, that is distributions g(x) = f(x) q , and defined expectations E q [.\ as the 
expectations computed with respect to these unnormalized escort. He defined a generalized Fisher 
information which is essentially the same as our Fisher information (|4ip - although it is written 
in terms of unnormalized g-expectation. Then, he derived a Cramer-Rao inequality |MJ Theorem 
1], [31 Theorem 4.1], with its case of equality. This inequality can be recovered at once from (|40l) . 
which is rewritten below in a developed form 

It suffices to simplify the normalizations N q [g], using the fact that oT 1 + = 1 to get 



dx 



> 1. 



(48) 



Eg [\X\ 



ax 



> 1, 



(49) 



recovering Furuichi's definition of generalized Fisher information and the associated Cramer-Rao 
inequality, with equality if and only if g{x) oc exp^(— ~f\x\ a ). 

5. Further remarks 

In this section, we add some further comments on two possible estimation procedures that can be 
derived by examination of the condition of equality in the g-Cramer-Rao inequalities. 



5.1. Maximum escort likelihood 

Let us first return to the case of equality in the generalized g-Cramer-Rao inequalities. For the 
second Cramer-Rao inequality, the condition (f29[) is 



^ In, (g(x; 6)) = c(9) sign (§(x) - 0) \§(x) 



a-l 



(50) 



Thus, we see that if the bound is attained (the estimator could then be termed "g-efficient"), then 
this suggests to look for the parameter that maximizes the escort distribution of the likelihood: 



9 mel = argmax g(x;9) = argmax 



9 M q {f(x;6)Y 



(51) 
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where MEL stands for "maximum escort likelihood". Indeed, in these conditions, we have that the 
derivative in the left of (l50l) is zero, and thus that 



86 



e=e h 



3(0) sign (§(x) 



= 0. 



(52) 



> = 0A 



Therefore, we get from the equality in the right side that 9(x) = 0m el- Hence, we see that if 
it exists a (/-efficient estimator, it is the estimator defined by the maximum of the escort of the 
likelihood. Of course, we recover the standard maximum likelihood estimator in the q = 1 case. 
The analysis of the properties of this estimator will be the subject of future efforts. 



5.2. Maximum Lq-likelihood estimation 

We also saw that in case of equality, then the likelihood, or equivalently its escort, must be under 
the form of a (/-exponential: 



f(x;6) ocexp, I / c{9) sign (0{x) - 9) 9(x) 



d0 



(53) 



Actually, it seems that there is only some very particular cases where this could occur. For instance, 
if the measurements consists in a series of independent and identically distributed observations 
Xi, then f(x;9) — Tlif(xi]9) and one would have to find a distribution f(xi]9) such that the 
product f(x; 9) writes as a q*-exponential. A possible amendment to the formulation can be 
to consider a (/^-product of the densities f(xi\9) instead of the standard product, and define 
f( q *'(x;6) — §Z) q if(xi;9). Such (/-likelihood has already been considered by |35| . Here, the 
Cramer-Rao inequality still applies for the (/*-likelihood f( q *\x;9), and the equality is obtained 
if f(i*\x;0) is &q ^-exponential. By the properties (fTOj) of the (/-product, we see that the individual 
densities f(xf, 9) must be ^^-exponentials. Similarly, we see that the escort-likelihood will have the 
form of g-exponential if we use the (/-product of the escort densities: g^'{x\ 9) = 0- . g{xi\9). 

The equality condition ([29]) . applied to the q escort-likelihood g^ (x; 9) then suggests to define 
the estimator as the maximizer of the q escort-likelihood, or, equivalently, as the maximizer of the 



In, escort-likelihood: 



9\iLq — argmaxhig ( g^ q \x; 9) ] = argmax 

9 \ / 6 




(54) 



Actually, the rule defined by (|54[) has been proposed and studied in the literature. It has been 
introduced by Ferrari [51 |3B], and independently by Hasegawa [6J. As a matter of fact, the first 
authors, defining a problem as in ([54]) with data distributed according to f(x;9), showed that the 
distribution g(x; 9) must be the escort of f(x; 9). These authors have shown that ([54]) yields a robust 
estimator with a tuning parameter, q, which balances efficiency and robustness. When the number 
of data increases, then the estimator appears to be the empirical version of 



9uh q = argmin —E [\riqg(xi 



arg mm 



9 N q [g;9] (1 - q) 



g(x;6)"dx-l , (55) 



which is nothing but the normalized Tsallis entropy attached to the escort distribution g. Such links 
between maximum likelihood and the minimization of the entropy with respect to the parameter 9 
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can be traced back to Akaike in [37]. Here, this gives a direct interpretation of the MLq method as an 
approximate minimum entropy procedure, and highlights the particular role of escort distributions 
in this context. Interestingly, it is shown in [5] and [6] that the asymptotical behaviour of the 
estimator is governed by a generalized Fisher information similar to (|28|) . Our findings add the 
fact that the MLq estimator satisfies the Cramer- Rao inequality (|27f . for the product distribution 
g®(x\9). 

6. Conclusions 

The generalized g-Gaussians form an important and versatile family of probability distributions. 
These generalized g-Gaussians, which appear in physical problems as well as in functional 
inequalities, are the maximum entropy distributions associated with Tsallis or Renyi entropy. In this 
paper, we have shown that the generalized g-Gaussians are also the minimizers of extended versions 
of the Fisher information, over all distributions with a given moment, just as the standard Gaussian 
minimizes Fisher information over all distributions with a given variance. Actually, we obtain more 
precise results in the form of extended versions of the standard Cramer-Rao inequality, which 
are saturated by the generalized g-Gaussians. These Cramer-Rao inequalities, and the associated 
generalized Fisher informations, recover, put in perspective and connect earlier results by Lutwak 
et al. [2] and Furuichi [31I34]. 

As a matter of fact, these characterizations of the generalized g-Gaussians appear as simple 
consequences of more general extended Cramer-Rao inequalities obtained in the context of 
estimation theory. Indeed, considering moments of any order of the estimation error, and using 
statistical expectations with respect to an escort distribution, we have derived two general Cramer- 
Rao inequalities that still include the Barakin-Vajda as well as the standard Cramer-Rao inequality 
as particular cases. This gives rise to general definitions of generalized Fisher information, which 
reduce to the standard one as a particular case, and make sense in this context. We have also 
characterized the case of equality and shown that the lower bounds of the inequalities can be 
attained if the parametric density belongs to a ^-exponential family. Finally, we have indicated that 
these findings suggest some new estimation procedures, recovering in particular a recent Maximum 
L g -likelihood procedure. 

These results have been derived and presented in the monodimensional case. An important 
point will be to extend these results to the multidimensional case. This would be important both for 
the estimation inequalities as well as for the Cramer-Rao inequalities characterizing the generalized 
g-Gaussians. As far as the latter point is concerned, some results are already available in [33], and 
should be connected to estimation problems. As is well-known, the Weyl-Heisenberg uncertainty 
principle in statistical physics is nothing but the standard Cramer-Rao inequality for the location 
parameter. Thus it would be of particular interest to investigate on the possible meanings of the 
uncertainty relationships that could be associated to the extended Cramer-Rao inequalities. Fisher 
information, Cramer-Rao planes have been identified as useful and versatile tools for characterizing 
complex systems, see e.g. [29, 38, 39J, and it would be therefore interesting to look at the potential 
benefits of using the extended versions in such problems. An open issue is the possible convexity 
property of the generalized Fisher information. Indeed, it is known that the standard Fisher 
information, as well as the generalized versions with q = 1, are convex functions of the density. 
If this were also true for any value of q, then it would be possible to associate to the generalized 
Fisher information a statistical mechanics with the standard Legendre structure and with the q- 
Gaussian as canonical distribution. Finally, future work shall also examine the estimation rules 
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suggested by our setting and study their statistical properties. 
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